NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Optimal Edge Caching for Individualized Demand Dynamics

https://doi.org/10.1109/TNET.2024.3369611

Quan, Guocong; Eryilmaz, Atilla; Shroff, Ness B (August 2024, IEEE/ACM Transactions on Networking)

Full Text Available
Minimizing Edge Caching Service Costs Through Regret-Optimal Online Learning

https://doi.org/10.1109/TNET.2024.3420758

Quan, Guocong; Eryilmaz, Atilla; Shroff, Ness B (January 2024, IEEE/ACM Transactions on Networking)

Full Text Available
Regret-Optimal Learning for Minimizing Edge Caching Service Costs

Quan, Guocong; Eryilmaz Atilla; and Shroff, Ness B (September 2023, IEEE WiOPT)
Prefetching and caching for minimizing service costs: Optimal and approximation strategies

https://doi.org/10.1016/j.peva.2020.102149

Quan, Guocong; Eryilmaz, Atilla; Tan, Jian; Shroff, Ness (January 2021, Performance Evaluation)
null (Ed.)
Full Text Available
A New Flexible Multi-flow LRU Cache Management Paradigm for Minimizing Misses

https://doi.org/10.1145/3376930.3376962

Quan, Guocong; Tan, Jian; Eryilmaz, Atilla; Shroff, Ness (December 2019, ACM SIGMETRICS Performance Evaluation Review)

Full Text Available
Counterintuitive Characteristics of Optimal Distributed LRU Caching Over Unreliable Channels

Quan, Guocong; Tan, Jian; Eryilmaz, Atilla (January 2019, Infocom)

Full Text Available
Asymptotic Miss Ratio of LRU Caching with Consistent Hashing

https://doi.org/10.1109/INFOCOM.2018.8485860

Ji, Kaiyi; Quan, Guocong; Tan, Jian (April 2018, IEEE INFOCOM)

To efficiently scale data caching infrastructure to support emerging big data applications, many caching systems rely on consistent hashing to group a large number of servers to form a cooperative cluster. These servers are organized together according to a random hash function. They jointly provide a unified but distributed hash table to serve swift and voluminous data item requests. Different from the single least-recently-used (LRU) server that has already been extensively studied, theoretically characterizing a cluster that consists of multiple LRU servers remains yet to be explored. These servers are not simply added together; the random hashing complicates the behavior. To this end, we derive the asymptotic miss ratio of data item requests on a LRU cluster with consistent hashing. We show that these individual cache spaces on different servers can be effectively viewed as if they could be pooled together to form a single virtual LRU cache space parametrized by an appropriate cache size. This equivalence can be established rigorously under the condition that the cache sizes of the individual servers are large. For typical data caching systems this condition is common. Our theoretical framework provides a convenient abstraction that can directly apply the results from the simpler single LRU cache to the more complex LRU cluster with consistent hashing.
more » « less
Full Text Available
LRU Caching with Dependent Competing Requests

https://doi.org/10.1109/INFOCOM.2018.8485891

Quan, Guocong; Ji, Kaiyi; Tan, Jian (April 2018, IEEE INFOCOM)

Caching systems using the Least Recently Used (LRU) principle have now become ubiquitous. A fundamental question for these systems is whether the cache space should be pooled together or divided to serve multiple flows of data item requests in order to minimize the miss probabilities. In this paper, we show that there is no straight yes or no answer to this question, depending on complex combinations of critical factors, including, e.g., request rates, overlapped data items across different request flows, data item popularities and their sizes. To this end, we characterize the performance of multiple flows of data item requests under resource pooling and separation for LRU caching when the cache size is large. Analytically, we show that it is asymptotically optimal to jointly serve multiple flows if their data item sizes and popularity distributions are similar and their arrival rates do not differ significantly; the self-organizing property of LRU caching automatically optimizes the resource allocation among them asymptotically. Otherwise, separating these flows could be better, e.g., when data sizes vary significantly. We also quantify critical points beyond which resource pooling is better than separation for each of the flows when the overlapped data items exceed certain levels. Technically, for a broad class of heavy-tailed distributions we derive the asymptotic miss probabilities of multiple flows of requests with varying data item sizes in a shared LRU cache space. It also validates the characteristic time approximation under
more » « less
Full Text Available
On Resource Pooling and Separation for LRU Caching

https://doi.org/10.1145/3179408

Tan, Jian; Quan, Guocong; Ji, Kaiyi; Shroff, Ness (April 2018, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

Full Text Available

Search for: All records